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I Crystal Structure of the 20S 
| Proteasome from the Archaeon 
§T. acidophilum at 3.4 A Resolution 

§;•: Jan Lowe, Daniela Stock, Bing Jap * Peter Zwickl, 

Wolfgang Baumeister, Robert Hubert 

;;|: The three-dimensional structure of the proteasome from the archaebacterium Thermo- 
§r plasma addophifom has been elucidated by x-ray crystallographic analysis by means of 
i£ isornorphous replacement and cyclic averaging. The atomic model was built and refined 
p to a crystallographic R factor of 22.1 percent. The 673-kilodalton protease complex 
j|v. consists of 14 copies of two different subunits, a and p, forming a barrel-shaped structure 
|v of four stacked rings. The two inner rings consist of seven p subunits each, and the two 
Ik outer rings consist of seven a subunits each. A narrow channel controls access to the 
|; three inner compartments. The a^p?*? subunit assembly has 72-point group symmetry. 
|: The structures of the a and 0 subunits are similar, consisting of a core of two antipataliel 
p sheets that is flanked by a helices on both sides. The binding of a peptide aldehyde 
|; inhibitor marks the active site in the central cavity at the amino termini of the 3 subunits 
i£ and suggests a novel proteolytic mechanism. 
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The proteasome is now widely recognized 
as the central enzyme of nonlysosomal pro- 
tein defoliation. It is involved in the deg- 
radation of misfolded proteins (i ) as well as 
in the degradation and processing of sKort 
lived regulatory proteins (2). Peptide alde- 
hyde inhibitors of the proteasome have been, 
shown to block the degradation of most 
cellular proteins, including long-lived pro- 
teins and the generation of peptides present- 
ed on major histocompatibility (MHC) class 
1 molecules in vivo (3). Thus the life-span of 
many cellular proteins seems to lie con- 
trolled by this proteolytic system. These 20S 
proteasomes occur in the cytosol and in the 
nuclei of all eukaryotic cells examined so far, 
and they l\ave also been found in the ar- 
cliaebactcrium Thermopiasma acidophilum 
(4). At least some of these functions are 
linked to a ubiquitin and adenosine Diphos- 
phate (ATP)-dependent pathway of pro- 
tein degradation involving the 26$ protea- 
some, an exceptionally large complex (M r ~ 
2.0 MD) (5). The 20S proteasome, or mul- 
ticatalytic protease complex, forms the cat- 
alytic core of the 26$ complex, whereas 19S 
"caps," which associate with the termini of 
this core have regulatory functions (6). The 
26S proteasome differs in its enzymatic 
properties from the 20S particle. The ATP- 
dependent degradation of ubiquitinated and 
of certain uonubiquitinated proteins (7) is 
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linked to the 26S complex, whereas the 20S 
proteasome only degrades unfolded proreins 
in an energy-independent manner (8). Elec- 
tron microscopic studies have shown that 
the quartemary structure of the. 20S com- 
plexes is conserved from Th&maplasma to 
humans (9). The cylindrical or barrel- 
shaped bodies, with dimensions of approxi- 
mately 1 50 A in height and 1 10 A in diam- 
eter, show a characteristic pattern of four 
stacked rings of subunits perpendicular to 
the cylinder axis. The proteasome of the 
archaebacterium T. aadophiiiOTi contains 
only two different subunits, a and p (25.8 
kD and 223 kD, respectively), whereas pro- 
teasomes from eukaryotes have a more com- 
plex subutiit pattern, consisting of multiple 
at and P type subunits related to either a or 
P subunits of the Thcrmoplasma proteasome. 
The Thermopiasma proteasome has there- 
fore been referred to as an "urproteasome" 
(10). The a and p subunits of the Thermo- 
ptasma proteasome exhibit sequence homol- 
ogy, indicating that they evolved from a 
common ancestral gene. The hallmark of 
the a-type subunits is an NH 2 -terminal ex- 
tension of about 35 amino acid residues not 
present in the p-type suhunits. In spite of 
numerous efforts to characterize the nature 
of the protease activity, its mechanism has 
remained unclear. On the basis of studies 
with small synthetic peptides and protease 
inhibitors, up to five distinct proteolytic 
activities have been assigned to the eukary- 
otic proteasome ( J I ). 

The aforementioned studies have not 
allowed unequivocal classification of the 
enzyme as a serine, aspartate or thiol pro- 
tease, and sequence analysis of proteasome 
subunits has not revealed similarities with 
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any other known protease (JO). Site-direct- 
ed mutagenesis experiments with recombi- 
nant Thermopiasma proteasomes exclude 
the possibility that this proteasome is an 
unusual cysteine- or serine-type protease 
(12) as suggested by inhibitor studies. Our 
structural studies described Mow and the 
site directed mutagenesis experiments, de- 
scribed by Seemullcr et at. (13) show that 
the proteasome represents a novel type of 
protease. Degradation studies performed 
wirh the Thermopiasma proteasome have 
shown diat the substrate proteins are 
cleaved rather unspecifically. Nevertheless, 
the degradation products fall into a narrow 
size range, wirh most of rhe fragments hav- 
ing from six to nine amino acid residues, 
suggesting that some kind of molecular ruler 
exists (14). This is of particular relevance in 
view of the role eukaryotic proteasomes 
have in antigen presentation (15). 

Structure determination. We have de- 
termined the crystal structure from the 20S 
proteasome of T. aridciphikcm at 3.4 A res- 
olution. The recombinant protein was pu- 
rified from Escherichia coli as described by 
Zwickl et al. (16) and crystallized as de- 
scribed (17). Briefly, a solution of protea- 
somes (7 mg/ml) in 5 mM 2-morpholino- 
propane^sulfonic acid (MPS), pH 7.5, I 
mM EDTA, and 1 mM NlaN 3 was mixed 
(3:1) with a solution of 10 percent polyeth- 
ylene glycol (PEG)- 1000, 0 j M potassium 
phosphate buffer, pH 7 J, and equilibrated 
against 15. percent PEG- 1000, 0.1 M potas- 
sium phosphate, pi 1 7.5. Crystals grew with- 
. in 1 week to a final sire of 0.8 by 0.5 by 0.2 
mm 1 at room temperature. Fresh crystals 
diffract to about 3.2 A resolution, bur their 
diffraction quality decreases rapidly in the 
x-ray beam. The crystals belong to the or- 
thorhombic space group P2 l 2 l 2 l and con- 
tain one proteasome in the asymmetric 
unit. The unit cell constants are a — 
311.9 A, b - 209.0 A, and c = 117.2 A 
(Table 1). In that the proteasomes showed 
sevenfold symmetry when observed by elec- 
tron microscopy (18), we expected seven- 
fold noncxystallographic symmetry. This 
was indeed shown by a self-rotation func- 
tion (17), and in addition seven noncrys- 
tallographic dyads were found perpendicu- 
lar to the sevenfold axis, which was tilted by 
6.4° with respect to the crystallographic a 
axis. The structure was solved by single 
isomorphous replacement (SIR) with the 
use of noncrystallographic symmetry. The 
large number of subunits and heavy atom 
binding sites in the asymmetric unit made 
the localization of the heavy atom positions 
difficult! In addition, most crystals showed 
significant non isomorphism which was not 
indicated by cell constant changes bur he- 
came evident by comparison of diffraction 
intensities. As an initial test of the useful- 
ness of derivatives, a self-rotation function 

533 



BEST AVAILABLE COPY 



From (613) 991-5695 



Order # 0597535 5DP04961964 Mori 23 Dan 2006 08:34:24 PM EST 



Page 3 of 8 



was calculated for the difference Patterson 
maps and analyzed for sevenfold symmetry. 
We found some promising compounds, in- 
cluding Ta^ (TABR) (19), Pt-terpyri- 
dinium-chlorkle (20), and Cs[OsCl 3 (CO) 3 l 
(OSCO), but further interpret arion by 
vector- verification mediods was not suc- 
cessful. Another derivative, Na^HSiWgO^ 
(W9) (21), showed only two sites in the 
analysis of the Harker sections, but was not 
sufficient for stmcfcjre determination. To 
overcome the problem of nonisomorphism, 
we collected iiauve aiul derivative data sets 
at 7 A resolution by soaking the same crystal 
in the .TABR solution and measuring it 
again. The self-rotation function of the re- 
sulting difference Patterson map showed a 
very clear sevenfold symmetry. 

The positioning of the local symmetry- 



Telar.ed heavy atoms was achieved by con- 
secutive rotational and rranslational search- 
es. Using the self-vectors, we searched for a 
heptagonal set of heavy atom sites centered 
at the origin and oriented as revealed by the 
self-rotation function. Wc then made a 
translattonal search in the cross-vectors for 
that heptagonal set. Hach rranslational 
search was performed in two dimensions on 
the Harker sections, which gave consistent 
coordinates for the heptagon of heavy at- 
oms and determined the local symmetry 
operator. Vector verification with this local 
symmetry operator revealed no further 
heavy atom positions. The seven sites were 
refined independently without anomalous 
differences. The initial SIR-phased electron 
density map at 7 A was not interpretable, 
but difference Fourier maps of two other 




Fig. 1 . Stereo view of the original SfR phased electron density at 3;5 A resolution, 1 4-fold averaged. The 
map is contoured at 1a above background and shows residues 123 to 143 of the p subunit. Superim- 
posed is the current mode! at 3.4 A resolution after refinement. 
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Fig. Z Ribbon drawing of (A) the o subunit; a helices are labeled by H, 0 strands by S. The NH 2 -terminal 
helix HO represents the front side of the whole proteasome. The identically oriented subunit within the 
complex is indicated (shown as a yellow sphere) at the top left comer; each subunit is represented by a 
sphere around its center of gravity, (B) the (J subunit in an according orientation (drawings made with 
MOLSGRIPT (48) and RASTER30 (49). 



derivative data sets, W9 and OSCO, clearly 
showed heavy atom sites. Those derivatives 
were used to determine the handedness of 
the TABR phases, but did not improve the 
electron density and were omitted. We as- 
sumed that the heptagon of TABR sites 
represented the central plane of the protea- 
some, thus satisfying the 7 2 -point symmetry 
and constructed a cylindrical envelope of 
154 A in length and 11 5 A in diameter, the 
largest possible dimension without overlaps 
from neighboring molecules. The local sym- 
metry axis was refined by a grid search, 
wherein we varied the orientation or trans- 
lation. Tins grid search was followed by 
three steps of cyclic averaging about the 
sevenfold axis on a 4 A grid monitoring the 
twofold symmetry correlation in the result- 
ing election density map. After conver- 
gence, the averaged map was used to calcu- 
late a new envelope by summing density in 
spheres of 20 A in diameter o with an appro- 
priate. cut-ofT. Starring at 7 A resolution, 26 
steps of averaging around the sevenfold axis 
and consecutive phase extension to 4.5 A 
resolution were calculated on a 2 A grid. At 
this stage, a further refinement step of the 
local symmetry axis was performed as de- 
scribed above with a 1 A grid and phases ar 
4.5 A resolution. The resulting parameters 
of the sevenfold local symmetry axis were 
sufficient to extend phases from 4.5 to 3.5 
A in 23 steps. The resulting twofold sym- 
metry had a correlation of 80 percent of the 
identity operation, and the electron density 
map was averaged 14-fold for final phase 
improvement- The resulting electron densi- 
ty was of very high quality, and the com- 
plete model could easily be fitted except for 
the NH 2 -tenninai 12 amino acids of the a 
subunit, which are not visible in the elec- 
tron density (Fig. 1 and Table 1). 

Structures of the subunits. The core of 
the a subunit is a sandwich of two five- 
stranded antiparallel p sheets with topology 
S8,S1, S2, S9. and S10 in the upper P sheet 
and S?, S6, S5, S4, and S3 in the lower 0 
sheet, respectively {Fig. 2A). These have a 
left-handed twist of about 50° and 80° be- . 
tween the terminal strands. The strand di- 
rections of the two p sheets enclose an 
angle of about 30°. The p sandwich is open 
at one side, where it is decorated with the ; 
NH r terminai a helix HO, which fills the . 
cleft. The opposite side of the p sandwich is . 
closed by four hairpin loops connecting P ■ 
strands S1-S2, S9-S10, S5-S6, and S3-S4, .'■ 
respectively. The upper and lower p sheets • 
of the sandwich are linked by an extended $ 
loop between S2 aivd S3, which forms a j 
main contact site to die neighboring « | 
subunit via H3 and S8 and the loop be-.;f 
tween S7 and S8. The p sandwich isJ| 
flanked by the a helices H3, H4, and H5 on^j 
top and by HI and H2 at the bottom, which*;! 
are antiparallel and tilted by 45° relative to A 
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*ach other. Residues Met 1 to He 12 were not 
divisible in the electron density and were 
; therefore omitted from the model. Residues 
S.i80 to 208 have weaker than average elec- 
tron density and the detailed backbone con- 
rfonnation may be uncertain in some places, 
rprotcasomal a subunits contain a putative 
nuclear localization sequence (NLS) and the 
^complementary NLS (22), which have been 
proposed to interact with each other in a 
. ; phcKphorylation-dependent manner. These 
! signals are conserved in the 'I "hermoplasma 
•proteasome (23). The NLS sequence com- 
prising residues 49 to 56 is situated at the 
end of S2, and the following extended loop 
and is accessible from the solvent. This is 
also true for a putative complementary NLS 
(cNLS), which is located at residue 201 to 
206 in the loop connecting H4 and S9. 

In the course of our model building, the 
structures of the ot and p suhunits emerged 
. to be strikingly similar. Hie Ca atoms of 
the two suhunits can be superimposed with 
a root-mcan-square (rms) deviation of 1 .3.3 
A (Fig, 3B). The structure of the £ subunit 
(Fig. 2R) deviates predominantly in its 
Nll r terminal pan from the « suhtmit, 
where it lacks residues 1 to 35. The 
. polypeptide chain of the P subunit starts 
with strand SI. The absence of a helix HO 
provides access to the interior of the fi 
sandwicli. The other major differences in 
die folding of the 3 subunit comprise the 
extended loop region between S2 and S3, 
the loop between H2 and S5, where the oc 
subunit contains an insertion of four resi- 
dues, and the OOOH-tenninal helix H5, 
which is more extended and followed by an 
additional loop in the p subunit. All 203 
residues of the P subunit are well defined in 
die electron density. The a and P subunits 
appear to represent a novel fold, although or. 
helices covering p sheets are a frequently 
occurring structural motif. 

The association within the ci and P rings 
is rather close, such that 24 and 17 percent 
of the surface of the a and P subunits, re- 
spectively, axe covered. In particular, HI 
from one a subunit forms a contact to the 
H2 helix of the neighboring a subunit The 
connecting loop between H2 and S5 pro- 
trudes into die channel of die heptagon of ot 
subunits and contacts the symmetry-related 
loops of rhe neighboring tx subunits on both 
sides, generating a narrow constriction. The 
loop preceding HO is situated in close prox- 
imity to rhis site and contacts the HO helix 
of the next a subunit. A further contact 
between a subunits is established by the loop 
region between S4 and HI of one subunit to 
. S7 of rhe next submit The p subunits form 
a heptagon with similar contacting segments 
, as described for the a subunits except for the 
contacts established by the HO helix of a, 
■ which is missing in the p subunit The major 
contact between a and P rings is formed by 



die H! and H2 a helices (Hg- 4A). The 
wedge-shaped HI, H2 substructure pene- 
trates an opening between subunits of the 
other ring. This interaction is quasi-symmet- 
rical by homologous segments of a and p. 
The buried surface is 10 percent and 1 1 
percent of each of the a and p rings. The 
dyad related contacts between the P rings 
covct 14 percent of the surface of one P ring. 
They comprise mainly dyad-related segments 
in H3 and H4 and the loops between H4 and 
S9 and between S2 and S3, which contact 
their symmetry mate. 

General architecture and inhibitor bind- 
ing. The overall shape of the 20S proteasome 



is an elongated cylinder liaving a central 
penetrating channel with large cavities and 
narrow constrictions (Fig. 4, B and D). The 
overall dimensions of the complex are 148 A 
in length and 113 A in diameter. As would 
be expected from previous immuno-electron 
microscopy studies (24), rhe a subunits are 
located at the ends, whereas the P subunits 
form the two equatorial rings. As also re- 
vealed by electron tomography (25), the 
central channel has three large cavities. Two 
of them are located at the interface between 
the a and the p rings, the third is formed by 
the p rings in the center of the molecule. 
The maximum diameter of this cavity is 53 



Table 1. Collection of data and statistics of structure determination and refinement Initial collection of 
native data sets and the screening for heavy atom derivatives was don© in house at 1 4°C with a 180-mm 
Mar research imaging plate detector (4 1) mounted on a Rgaku rotating anode x-ray generator operated 
at 4.8 kW {\ — CuKa = 1 .5418 A). Graphrte-rricroohrc^tb^ x-rays were coflimated to 0,5 mm in 
diameter, the crystal to detector eSstance was 350 mm to separate the spots In the a* direction. 
Absorption at this distance was reduced wilh the use of a helium chamber. Diffraction data of the NAT! 
1 and the corresponding TABR derivative crystal were collected to 7 A resolution. The TABR-crystal was 
soaked for 1 hour in 1 rnMTABR dissolved in 15 percent PEG -1000. 15 percent propan-2-ol. and 0.1 M 
potassium phosphate. pH 7.5. The final data set to 3.4 A resolution was collected at the BW6-beamline 
at DESYAHamburg (X «= 1.0 A) from six different crystals. Diffraction data were collected with a 300-mm 
Mar research imaging plate at a distance of 430 mm. The total number of measured reflections in the 
merged data set was 793,206, with 90,045 unique reflections between 10 and 3.4 A resolution. AH x-ray 
intensities were evaluated with the MOSFLM program package version 5.2 {42). Scaling and merging 
were done with CCP4 (43), difference Patterson maps were calculated and interpreted with PROTEIN 
(44), and real space averaging was performed with MAIN (45). The initial mode! was built at 3.5 A 
resolution with FRODO (44) on an ESV-30 Graphic system work station {Evans & Sutherland, Salt Lake 
City. Utah). The model was completely fitted in the first density and refined by energy and 14-foJd 
noncrystallographic symmetry restrained crystaitographic refinement with X-PLOR (45), with parameters 
as described by Engh and Huber (47). After refinement. 2F 0 - F c Fourier maps were calculated, and the 
model was checked and rebuilt. The resolution was extended from 3.5 to 3.4 A during the refinement 
procedure. Individual 8 factors were refined with the use of 14-fold noncrystallographic restraints and 
restraints between bonded atoms. The current model has been refined to a crystallogmphic & factor of 
22.1 percent, with data from 10 to 3.4 A resolution, with /At, > 2.0 (86,922 unique reflections) and 
comprises 45,892 nonhydrogen atoms rnaintaintng strict geometry with deviations from ideal values as 
shown In the table. The model comprises residues 13 to 233 ol the 233 amino acids of the « subunit and 
afl the 203 residues of the p subunit. 
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77^: - Fpl/SFp^ where F PH and F p are the derivative and the native structure factor amplitudes, 

respectively. tPhasino, power: Fj/residuat: rms moan heavy atom caUributiort/rriis residual, defined as: {(Fp,.^ 
F„ J/rV } w with the sum over all reflections, wtiere Fp^^ is the calculated structure tactor and F ^ , It the structure (actor 
amp&tudo of the heavy atom derivative, respectively. tfln^S^I/^-CWll^^ 
value ot the «'-th rneasurement of h and if^ Is the corresponding mean value of n lor all i measurements of h; the 
summation runs over all measurements. STrw rms deviation m bond lengths and bond angles from rdeofl 
values. ffhe rms deviation between roncrystcffSographic symmetry related atom positions. TThenrrs deviation 
in temperature factors between bonded and rroncrystaEograptw; symmetry related atoms. 
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A. Access to these compartments is con- 
trolled by four narrow gates, which are. 
formed by the loops connecting H2 and S5 
in both subunits. In the a subunit, the loop 
Tyr l2t \ Giy U7 , and Gly l?ft protrudes into the 
channel and fonus a hydrophobic ring 
around the channel, leaving a passage of 
only 13 A in diameter at its entrance. The 
homologous loops of the p subunits form a 
bottleneck of 22 A in diameter in the central 
part of the complex, ringed by Tyr 95 and 
Met 96 residues, which protrude into the 
channel (Fig. 5A). 

For inhibitor binding studies, a protca- 
some crystal was soaked for 36 hours with 
acetyl-Leu-Leu-norlcucinal [LLnL, calpain 
inhibitor I (Bachem)] a potent inhibitor of 
the proteasome's Suc-Len-Leu-VaUTyr- 
AMC cleaving activity (26). This peptide 
aldehyde is an inhibitor of cysteine proteases, 
such as cathepsin B (27). The unavcraged 
difference electron density at 4 A resolution 
shows die tjvpeptide bound to all 14 P sub- 
units. The tripeptide is situated with its al- 
dehyde function at the NH r tcrnunal threo- 
nine of the 0 stihunit in a hydropliobic cleft 
inside the central cavity of the complex (Fig. 
5). The three amino acids of the inhibitor 
(II to 13) are in extended conformation, and 
form hydrogen bonds from 12 to Thr 21 on 
one side and from 13 to Gly 47 and from 11 to 
Val 49 on the other side. The bound inhibitor 
peptide thus extends the short P strand SI of 
the P subunit. The active site flanking. resi- 
dues 20, 21, and 22 belong to the extended 
loop between S2 and S3, and residues 46, 47, 
and 48 form the loop between S4 and HI. 
The hydrophobic cleft is formed by He 45 , 
Ala 46 , Leu 48 , and Val 49 on one side and by 
Val 20 , Thr 21 , and Met 22 on the orher .side. In 
addition, residues 112 to 120 of the neigh- 
boring p subunit come close to the active 
site, and Asp 114 may interact with the N- 
acetyl group of the inhibitor, but this possi- 
bility is not resolved by rhe present electron 
density data. Residues Lys 33 and Glu 17 form 
a salt bridge and are in close proximity to 
Thr 1 , The difference electron density shows 
no conformational changes in the protein on 
binding the inhibitor. 

In accordance with the lack of sequence 
similarity to other proteins, the a and p 
subunits appear to exhibit a novel topology. 
The folding of the two subunits is strikingly 
similar, despite the insignificant amino acid 
sequence similarity (Fig. 3 A): Differences 
are seen at the NH r and CXX)H-termini, 
notably the NH r termina! ct helix HO of 
the a subunit, and in loop regions (in par- 
ticular at the S2-S3 and the 1 12-S? loop). 
Eukaryotic proteasomal subunits related to 
the archaelwcrerial a subunit display highly 
conserved NH r terrainal regions (23, 28). 
This a helix may be crucial for the interac- 
tion with regulatory complexes such as die 
195 complex. The helix is situated near the 

>S36 



entrance gate to the channel, the preceding 
12 amino acid residues, which have no de- 
fined electron density, are disordered so that 
their precise role is uncertain. The a sub- 
units assemble spontaneously to form seven- 
mcmbered rings, whereas p subunits do not. 
Since deletion of the first: 35 amino acid 
residues in the a subunit abolishes rhis ahil- 
ity, the NH 2 -terminal extension appears to 
have a critical role in assembly (28). The 
predicted position of HO within rhe chan- 
nel (29) is at variance with the observed 
structure. The four protein disks stack to 
fonn tight contacts between the oc/p and 
3/P rings, respectively. The ot/p contact is 
established mainly by the a helices HI and 
H2, and rhe p/P contact is formed by a 
helices H3, H4, and loops H4-S9 and S2-S3. 
The p/P and ot/P double disks have strict 
and approximate dyad symmetry, respective- 



ly. The tight packing appears to make the 
particle impenetrable from the outer walls of 
rhe cylinder (Fig. 4C), allowing access only 
through the central channel (Fig. 4, B and 
D). 'Die more constricted outer gates are 
formed by the loop between H2 and S5 of 
the a subunit, which is more extended than 
in the p subunit and allows rhe passage of 
extended polypeptide chains only. The in- 
ner gates are formed by equivalent segments 
of the P subunits. The outer and inner gates 
control the exchange between solvent and 
the proteasome and between outer and in- 
ner proteasomal cavities, respectively. The 
conserved tyrosine residues of those looi>s 
which protrude into the channel may play a 
crucial role in strucutring the gates and in 
assisting the passage of substrates and prod- 
ucts. The unfolded state of the substrate 
may be maintained by the hydropholnc en- 
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Fig. 3. (A) Sequence alignment of the o and of the 0 subunit [BEST FIT, UWGCG package (50)1 with a ; 
sequence identity of 26 percent. The secondary structure deduced (ram the refined model calculated 
with DSSP (57), is superimposed. Abbreviations tor trie amino add residues are A, Ala; C, Cys; D, Asp; ; 
E, Glu; H, Phe; G, Gry; H, l-fe; I, fle; K, Lys; L, Leu, M. Met; N, Asn; P, Pro; Q. On; R, Arg; S, Ser; T. frv; = 
V, Val; W, Trp; and Y, Tyr. (B) The backbone of ttie two subunits superimposed wtth an rms deviation of ;l 
1 .33 A with 84 percent of tfie afigned sequence (GRASP) (52). The rms fit was done with MAIN [4% 
including residues a35^i52/p1-p18. a66-at23/p33^90. a l3O-a142/p93-pi06, ^145^202/^ 107- 1 
p164, and a2O9-a233/0l 96. The orientatiori Of Ux? molecules is Ihe senie as in Fig. 2. | 
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||y,iioncncnt of the channel's interior. 

sf§£ The 20S proteasome has some features 

^tO' common with the bacterial chaperon in 



GroEL (30). The x-ray stmcturc of GroEL 
(31) allows a more detailed comparison of 
the two proteins. GroEL is an 850-kD com- 
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■: fig. 4. (A) Ca drawing of two a (red) and two & subunits (blue) showing the major contacts between the 
^subunits. (B) Top view of the 20S proteasome showing Ca atoms only. (Q Side view of the 20S pro- 
:: teasome. The overall cfimensions are 148 A in length. 113. A maximum diameter and 75 A minimum 
^xSameter. (D) View of the proteasome cut open along the sevenfold axis; the three compartjnents and 
pSte gates are deailyvts&te. Dimensions; entrance to Ihe channel: <f> 13 A bottlenecks in the <»ntcn 4> 22 
| A left and right cavity: $ 50 A and length 40 A, central cavfty: <J>53 Aand length 38 A. 
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plcx consisting of one 14-fold repeating 
subunit satisfying 72 point group symmetry, 
and participates in ATP-dependent poly- 
^eejSfde: chain folding. Indeed both protein 
complexes have a central channel through 
ilie whole molecule. However, while, the 
interior is divided into three distinct com- 
partments and access to these compartments 
is controlled by narrow channels m ttic pro- 
teasome, the GroEL channel is more or less 
continuous. GroEL has relatively large side 
windows, while the proteasome is tightly 
sealed. Moreover, there is no similarity in 
the tertiary structure of the subunils. 

Although the LLnL inhibitor model was 
fitted to a 4 A resolution map, its position 
and conformation is well defined by elect ton 
density and stereochemical considerations 
except for die 11 and 12 side chains, which 
are solvent exposed and may he mobile (Fig. 
5C). The inhibitor is bound in a 0-sheet 
conformation between residues 46 to 48 and 
20 to 22, which belong to the extended loop 
between S2 and S3. This loop is stabilized by 
five main contacts of which four are contacts 
to symmetry-related 0 subunits. This may 
explain why monomeric processed P sub- 
units arc not active (28). The aldehyde func- 
tion of the LLnL mhibrcor is situated close to 
ihe side chain of Thr 1 of the ($ subunit:, and 
may form a liemiacetal with the hydroxy I 
group of Thr 1 . In conjunction with the site 
directed mutagenesis experiments descrihed 
by Seemuller etd, (i3), this suggests that the 
Thr'Oy acts as die nucleophile in peptide 
hydrolysis. Close to it is Lys 33 N£ ami Glu 17 
Oe We propose that together with the NH 2 - 
terminal amino group, these residues fonn a 
catalytic tetrad. In the prototypical serine 
proteases, the alcoholic function of a Ser 
residue is activated by a neighboring His, 
which acts as proton acceptor (32). Analo- 
gous functions could be provided in the pro- 
teasome by Thr 1 Oy and Lys i3 N€ or Thr 1 N; 
Glu 17 orients Lys 33 in analogy to the role of 
Asp 1 99 in chymotrypsin. As Lys" is prohahly 
protonatcd and unlikely to act as proton 
acceptor, we propose the Thr 1 amino group 
for this role. Thr 1 Oy-H • * *-N may form a 
five-raembered ring structure in favorable 
hydrogen bonding geometry. The positively 
charged Lys 33 could further lower die pK a of 
Thr 1 N by electrostatic effects. The carbonyl 
groups of Ala 168 and An* 19 fonn a hydrogen 
bond to the NH 2 -tcrminus, thus orienting 
and polarizing it. The oxyanion of the tetra- 
hedral transition state, which develops in 
the acytation reaction may be stabilized by 
the main chain amkJe of Gly 47 , which points 
toward the aldehyde group of the bound 
inhibitor. 

The substrate specificity of the archae- 
bacterial 20S proteasome is not yet clearly 
defined, hut cleavage seems to be rather 
unspecific ( /4). Indeed, die structure of the 
complex shows a large pocket for the PI 
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norleucine, mainly because of Gly 31 . Other 
residues in contact with the norleucine at 
PI arc mainly hydrophobic as described 
ahove. The SI pocket is open toward the 
solvent and may accept side chains of dif- 
ferent size and charge- The Leu P2 side 



chain is not in direct contact with the 
protein and is probably unrestricted in size. 
Our study provides no information on pos- 
sible binding sites beyond P3 f and longer 
peptides may be relatively tree to move. 
This may be iraponant in explaining the 




Fig. 5. (A) F 0 - F c difference Fourier map of the Ac-Leu-Leu-ncrteudnal inhibitor (LLnL) at 4 A resc4ution. 
Superimposed Is the ourrent model of the p subunit. FourteenfoW averaged difference electron density 
showing seven inhtottors sitting around the central cavity formed by 0 subunits. Distances between active 
sites within one (3 ring are 28 A, 51 A, and 64 A. (B) Stereo drawing of the accessibte surface of the central 
cavity built by two rings of p subunits cut open along the sevenfold axis. Superimposed is a space-filling 
model of the LLnL inhibitors (GRASP) (52). (C) Stereo drawing of the active site with a 14-fold averaged 
F 0 - F c map. Superimposed is the nonrefined inhibitor model (MAIN) (45). Data collection of the inhibitor 
data set: a proteasome crystal was soaked in 5 mM LLnL dissolved in 15 percent PEG- 1000, 15 
percent propan-2-oI. 0.1 M potassium phosphate, pH 7.5. for 36 hours. A complete data set to 4 A 
resolution was collected and evaluated as described in Table 1 for NATI with a 300-mm imaging p4ate 
[Mar researcher)]. 
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proteasome's ability to generate peptides of 
defined length. In eukaryotes, these pep- 
rides are exported to the cell surface and 
presented by MHC class I molecules to 
initiate the T cell immune response (33). 
The Thermoplasrna proteasome also shows a 
distribution of peptide products centered at 
hepta- or octa peptides. In line with previ- 
ous considerations, wc suggest that this is 
achieved by the spatial distribution of the 
active sites (14). If the acylenzyme interme- 
diate generated in the initial proteolytic 
reaction has sufficient lifetime, the bound 
substrate might pass through the central 
cavity until it meets 1 of the other 13 
symmetry related active sites. The shortest 
distance between two active sites is 28 A, 
which may be spanned by a hepta- or oc- 
tapeptide in extended conformation. Long- 
er peptides may be generated by two active 
sites at a wider distance, shorter ones could 
he due to premature dcacylation, lebihding, 
and processing. A relatively stable acylen- 
zyme intermediate could also explain the 
complete sequential degradation of proteins 
by eukaryotic proteasomes (34) as the resid- 
ual polypeptide chain remains bound to an 
active site of the proteasome. Our model 
also predicts an oligo(octa)peptidyl car- 
boxypeptidase activity. It is obvious that 
the inhibitor peptide cannot be continued 
in regular conformation in the P' direction 
(Fig. 5, A and B); therefore, it may turn 
away from the protein where it would have 
freedom to move or it may have binding 
sites on the protein surface. Data with suit- 
able analog inhibitors should provide an- 
swers to these possibilities. These structural 
features explain the lack of specificity of the 
proteolytic action of the proteasome- The 
implication of lysine and hydroxyl groups is 
not without precedent and has been report- 
ed for the leader peptidase from E. coli (35). 
Lysines and serines in the active sites of 
serine proteases have been classified by 
Rawlings and Barrett (36) and shown to 
occur in a family containing the serine-type 
O-Ala-D-Ala carboxypeptidase and P lacta- 
mase by crystal structures. The latter en- 
zyme opens 0-lactam rings, which mimic 
peptide bonds. The active site of P lacta- 
mase from Staphylococcus aureus consisting 
of Ser 70 , Lys 7 \ and GW 66 (37) shows a 
geometry similar to that of Thr\ Lys 33 , and 
Glu 17 in the proteasome although in a to- 
tally different structural context. The D- % 
Ala-D-Ala carboxypeptidase catalyzes the. 
crosslinking of peptidoglycans in bacteriai| 
cell walk by tianspeptidatiun. These,: 
transpeptidases contain lysine, serine, atvk 
glutamatc in their active sites (38). The|? 
transpeptidase reaction requires a lon*K; 
lived acylenzyme intermediate (39), possi?: 
bly also imporranr for the proteasome. The* 
structure of a penicillin acylase suggests th<| 
NH 2 -cenninal serine residue as the catalytfc 
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ttiucleophile similar to the NH 2 -tenninal 
/ihteonine of the proteasome (40). 
\, * The TTiermnptasnia 20S pioteasome close- 
ly resembles in size, shape, and nmctional 
properties die eukaryotic 20S proteasome. 
ftjj subunirs of the eukaryotic proteasomes 
are related by amino acid sequence similarity 
to the a or p subunit of the T. addopMum 
proteasome, suggesting a very similar catalyt- 
ic mechanism (13). The much simpler sub- 
unit composition and internal symmetry of 
the archaehacterial proteasome was the key 
to structure determination, which now pro- 
vides a detailed model for the elucidation of 
the suueutre and biochemical roles of eu- 
karyotic proteasomes. 

REFERENCES AND NOTES 



1. J. M. Peters. Tmnds Biochem. Sd. 19. 377 (1994): 
A. .1. Rh/Htt. Biochem. J. 291, 1 (1003); A. L GoW 
berg. Eur. J. Bkxham. 203, 9 (1902); M. Rcctv 
steiner. L Hoffman, W. Dubiet, J. Biof. Oiem. 268, 
60 (1993); A. l tershko and A. aechanover, Amu. 
to.fifacf>em61,76l(1992). . 

. 2. V. J. Pabmbefla. 0. Rando. A. L Gctdbeirj, T. 
Maniafc. CeH7B,//2 (1994); A. Ctechanovor, &W. 
79, 1 3 (1 994); 8. Ffichter-Ruoff and 0. H. Wolf, FEBS 
tell. 338.34(1993)- 

3. K.LRockcra'.,Ce07e. 7GM1994). 

4. B. OaWmann et ai., FEBS Lett 251 . 1 25 (1 989). 

5. J. M. Peters, 2. Cejka, J. a Harris, J. A. Kteinschmklt, 
W. Baumeister, J. Mot Biol. 234, 93? (1993). 

6. J. M. Peters, W. W. Franks, J. A. Kianschmfcfi, J. 
Bid. Chem. 269. 7709 (1 994); L Hoffman, G. Pratt, 
M. Rechsteiner. 2x1 267. 223R2 (1992); T. Yo- 
shirrajra et at. J. Struct Biof. 1 11 . 200 (1993): G. N. 
DeMartino ef ai.. J. Bio*. Chem. 269, 20887 (1994). 

7. V. MurefcHmi ef ai.. Nature 360. 597 (1992). 

8. J. M. Fagan and L. Waxman. J. Biof. Chem. 267, 
23015(1932): T. Wenzel and W. Baumeister, FEBS 
Leli. 326, 215 (1993); W. Matthews, K. Tanaka. J. 
Driscofl, A Ichihara. A L GoWberg, Ftoc. Natl. 



Acad. Set. U.S.A. 86, 2579 (1989). 

9. G. Puehtar, F. Pitzer. P, ZwicW.W. Baurncister.SKsf. 
Appl. Microbiol. 16. 734 (1994). 

10. P. Zwckl e* a/-, Bochentistry 31, 964 (1992); K. 
Tanaka. T. Taniuia, T. Yosnimura. A. Ichihara, New 
Biol. 4, 173 (1992); W. Heinemeyer. N. Trondle. G. 
Albrecnt, D. M. 12229(1994). 

11. M. Ortowski, C. Cardozo. C. Mictoud, Dbchcmistry 
32, 1563 (1993); H. Djaballah and A J. Rh/ett, ibid. 
31,4133(1992). 

12. E. SaxnuBcr, A. Lupas, r. Zoni. P. ZwlcW. W. 
Baumeister. FEBS Lett. 359, 173(1995). 

13. £ SeemuBer el ai. . Science 268. 579 (1995). 

14. T. Wenzel, C. fcckerskom, h Lottspeich, W. t3au 
metster. FEBS Lett 349. 205 (1994). 

15. M. G. Brwffi, J. DriscuS, J. J. Mofiaco, Nature 353, 
355 (1991); R Glynne ct ai..Md., p. 357; J. Driscofl 
and 0. Finley. Ce* 68, 823 (1992); A L Goldberg 
aiKl K. L. Rock. Nature 357, 375 (1 992). 

16. P. ZwcW. F. Ijottspeich. W. Baumeister. iCUS Lett 
312,157(1992). 

17. a Jap et d/., J. Mol. BioL 234. 881 {1 993). 

18. G.Puhter of ai.. SMBOJ. 11. 1607(1992). 

19. P. H. Kuhn and R. E McCartey. fnorg. Chem 4. 
1482(1965). 

20. H. M. Broihers 3 andN. M. Kostic. Biochemistry 2Q. 
7468 (1990). 

21. R.G. Fmke, a Rapko, R. J. Jaxton, P. J. Domaifle. J. 
Am. Chem. Soc. 108. 2947 (1986) 

22. K. lanakaerai..f-£BSi_srf. 271, 41(1 99(^. 

23. P. ZwicW, F. Uottspeidi. D. DaWrnam, W. Oaurnc'e 
ter, ibid. 278,217(1991). 

24. A. Grziwa. W. Baumeister, B. DaWmann, F. Kopp, 
ibid. 290.186(1991). 

25. R. Hegert, G. Pfeiter, G. Puhler, B. OaMmarm. W. 
Baumetster , ibid. 283. 1 17 (1991). 

26. B. Datilmana L. Kuehn. A Gfidwa, P. ZwicW. W. 
Baumeisler, Eur. J. Biochem. 208, /89 (1992). 

27. T. Sasaki ct ai. t J. Enzyme inhibition 3, 195 (1990). 

28. P. ZwicR J. Kteinz. W. Baumefeter. Natijrn Struct 
Biot. 1.765(1994). 

29. A. Ujpas, A J. Koster, T. Wufe, W. Baumeister, 
FEBS Lett. 354. 45 (1994). 

30. P. ZwickJ et at, J. Stntct. BioL 103. 197 (1990). 

31 . K. Braig et at.. Nature 371 , 578 (1994). 

32. R. Ruber and W. Bodo, Ace. Chem. Hes. 11, 114 
(1978), 

33. L. R. Oickef ai.. J. Immunol. 152, 3884 (1994). 



34. L. R Dick. C. R Moomaw, G. N. DeMartino, C. A 
Slaughter, Biochemistry 30. 2725 (1 991). 

35. W. R Tschantz, M. Sunq, V. M. De^ado-Partin, R. 
E. Datooy. J. Biol. Chem. 268. 27349 (1993). 

36. N. D. Rawltngs and A. J. Barrett, Biochem. J. 290, 
205(1993). 

37. C. C. H. Chen, J. Rani, R F. PratL 0. 1 ierzberg. J. 
AtotBfa/. 234, 165 (1993). 

38. J. A Ketty, J. R Kiiox. H. ZHao, J. Frere, J. Ghuysen. 
ibid. 209, 281 (1989). 

39. J-M. Ghuysen at el., Top. Mol. Pharmacol. 1, 63 
(1981). 

40. H. J. Dugglesby et ai.. Nature 373, 264 (1995). 
4t. 2. Oauter, H. Terry, H. Witeel, K. S. Wteon. Acta 

CrystaSogr. B46. 833 (1990). 

42. A. G. W. Leslie, Joint CCP4 and ESF-EACfdB News- 
fen. Protein Crystatogr. 26 (Daresbury Laboratory, 
Warrington. UK, 1992). 

43. CCP4. Science and Engineering Research Council 
(SERC^ 0)tebcratKreO)rrpiitir^ 

Oawisbiffy Laboratory, Warrington, UK. 1979). 

44. W. Stetgemann, CrystaifograptHC Computing 5, D. 
Moras. A D. Podjamy, J. C. Thierv. Eds. {Oxford 
Univ. Press, Oxford, 1991). 

45. D. Turk, PhD. thesis. Technischc Urwersitfit 
Munchen (1993. 

46. T. A Jones, J. Appi. CrystaOogr. 1 1 , 268 (1978). 

47. A T. Bruenger, X-PLCH Version 3. J: A Sysi&n for 
CrystaBogtaphy and NMR (Yale Univ. Press, New 
hlavcn, CT, 1992); R A Engh and R Huber, Acts 
CrystaSogr. A47. 392 (1991). 

48. P. J. KrauSs, J. Anpt. CrystaSogr. 24. 946 (1991). 

49. £. A. Merritt and M. E. P. Murphy, Acta CrystaSogr. 
O50. 869 (1994). 

50. Genetics Computer Group, Wisconsin. USA (1 991). 

51. W. Kaosch and C Sander, Blopolymers 22. 257/ 
(1983). 

52. A NfchoBs and B. Honig. Cdumbia University 
(1993). 

53. We thank M. Stubbs for critical readtng of the manu- 
script ajva II D. BartunJ< lor heJp v/tth the x-ray data 
cofceforu Supported in part by Fonds der Cternisch 
en IndusUie (D.S.). The coordinates have been de- 
posited in the Protein Date Bank. Brookhaven, NY, 
wrth tracking number 16750 tor the coordinates and 
T5751 tor the observed structure factor amplitudes. 

20 Decemner 1994; accepted 10 March 1995 



AAAS-Newcomb Cleveland Prize 

To Be Awarded for a Report, Research Article, or an Article Published in Science 



The AAAS-Newcomb Cleveland Prize is awarded to the 
author of an outstanding paper published in Science. The value 
of the prize is $5000; the winner also receives a bronze medal. 
The current competition period began with the 3 June 1 994 issue 
and ends with the issue of 26 May 1 995. 

Reports, Research Articles, and Articles that include original 
research data, theories, or syntheses and are fundamental contri- 
butions to basic knowledge or technical achievements of far- 
reaching consequence are eligible for consideration for the 
prize. The paper must be a first-time publication of the author's 
own work. Reference to pertinent earlier work by the authormay 
be included to give perspective. 



Throughout the competition period, readers are invited to 
nominate papers appearing in the Reports, Research Articles, or 
Articles sections. Nominations must be typed, and the following 
information provided: the title of the paper, issue in which it was 
published, author's name, and a brief statement of justification 
for nomination. Nominations should be submitted to the AAAS- 
Newcomb Cleveland Prize, AA AS, Room 924, 1 333 H Street, 
NW, Washington, DC 20005, and must be received on or 
before 30 June 1995. Final selection will rest with a panel of dis- 
tinguished scientists appointed by the editor-in-chief of Science. 

The award will be presented at the 1996 AAAS annual 
meeting, in cases of multiple authorship, the prize will be 
divided equally between or among the authors. 



SCIENCE • VOL2G8 • 26 APRIL 1995 



BEST AVAILABLE COPY 



